Recognition and Classification of Numerical Entities in Basque

نویسندگان

  • Ander Soraluze
  • Iñaki Alegria
  • Olatz Ansa
  • Olatz Arregi Uriarte
  • Xabier Arregi
چکیده

This paper presents a system based on Finite State Technology that recognises and classifies numerical entities in texts written in Basque. The system deals with a wide range of entities, such as temporal expressions, numbers related to units of measurement, or those that refer to common nouns. The system obtains 86.96% F-measure score following MUC evaluation and 78.82% using IREX and CONLL simple scoring protocol.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effect of sound classification by neural networks in the recognition of human hearing

In this paper, we focus on two basic issues: (a) the classification of sound by neural networks based on frequency and sound intensity parameters (b) evaluating the health of different human ears as compared to of those a healthy person. Sound classification by a specific feed forward neural network with two inputs as frequency and sound intensity and two hidden layers is proposed. This process...

متن کامل

Named Entity Recognition and Classification for texts in Basque1

This paper presents a system for Named Entity (NE) recognition in written Basque to be used in a CLIR application. Being an agglutinative language, Basque has highly inflected forms, so a previous linguistic preprocess is required. The tool we present relies on a combined method that carries out the identification and recognition of entity names in two subsequent steps. First, a grammar based o...

متن کامل

Question Generation Based on Numerical Entities in Basque

This article presents a question generation (QG) system which is integrated within an automatic exercise generation system. The QG system deals with Basque language and the target selection is restricted to numerical entities. In this article we present an experiment which was conducted on a specialised corpus on science and technology and the system was evaluated manually and automatically.

متن کامل

Named Entities Translation Based On Comparable Corpora

In this paper we present a system for translating named entities from Basque to Spanish based on comparable corpora. For that purpose we have tried two approaches: one based on Basque linguistic features, and a language-independent tool. For both tools we have used BasqueSpanish comparable corpora, a bilingual dictionary and the web as resources.

متن کامل

Improvement of Chemical Named Entity Recognition through Sentence-based Random Under-sampling and Classifier Combination

Chemical Named Entity Recognition (NER) is the basic step for consequent information extraction tasks such as named entity resolution, drug-drug interaction discovery, extraction of the names of the molecules and their properties. Improvement in the performance of such systems may affects the quality of the subsequent tasks. Chemical text from which data for named entity recognition is extracte...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011